Finite Element Integration with Quadrature on the GPU

نویسندگان

  • Matthew G. Knepley
  • Karl Rupp
  • Andy R. Terrel
چکیده

We present a novel, quadrature-based finite element integration method for low-order elements on GPUs, using a pattern we call thread transposition to avoid reductions while vectorizing aggressively. On the NVIDIA GTX580, which has a nominal single precision peak flop rate of 1.5 TF/s and a memory bandwidth of 192 GB/s, we achieve close to 300 GF/s for element integration on first-order discretization of the Laplacian operator with variable coefficients in two dimensions, and over 400 GF/s in three dimensions. From our performance model we find that this corresponds to 90% of our measured achievable bandwidth peak of 310 GF/s. Further experimental results also match the predicted performance when used with double precision (120 GF/s in two dimensions, 150 GF/s in three dimensions). Results obtained for the linear elasticity equations (220 GF/s and 70 GF/s in two dimensions, 180 GF/s and 60 GF/s in three dimensions) also demonstrate the applicability of our method to vector-valued partial differential equations.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Efficiency of Anti-Hourglassing Approaches in Finite Element Method (TECHNICAL NOTE)

one of the simplest numerical integration method which provides a large saving in computational efforts, is the well known one-point Gauss quadrature which is widely used for 4 nodes quadrilateral elements. On the other hand, the biggest disadvantage to one-point integration is the need to control the zero energy modes, called hourglassing modes, which arise. The efficiency of four different an...

متن کامل

Free Vibration of Functionally Graded Cylindrical Shell Panel With and Without a Cutout

The free vibration analysis of the functionally graded cylindrical shell panels  with and without cutout is carried out using the finite element method based on a higher-order shear deformation theory. A higher-order theory is used to properly account for transverse shear deformation. An eight noded degenerated isoparametric shell element with nine degrees of freedom at each node is considered....

متن کامل

Convergence Analysis of a Quadrature Finite Element Galerkin Scheme for a Biharmonic Problem

A quadrature finite element Galerkin scheme for a Dirichlet boundary value problem for the biharmonic equation is analyzed for a solution existence, uniqueness, and convergence. Conforming finite element space of Bogner-Fox-Schmit rectangles and an integration rule based on the two-point Gaussian quadrature are used to formulate the discrete problem. An H2-norm error estimate is obtained for th...

متن کامل

Generalized Gaussian Quadrature Rules for Discontinuities and Crack Singularities in the Extended Finite Element Method

New Gaussian integration schemes are presented for the efficient and accurate evaluation of weak form integrals in the extended finite element method. For discontinuous functions, we construct Gauss-like quadrature rules over arbitrarily-shaped elements in two dimensions without the need for partitioning the finite element. A point elimination algorithm is used in the construction of the quadra...

متن کامل

Generalized Gaussian Quadrature Rules in Enriched Finite Element Methods

In this paper, we present new Gaussian integration schemes for the efficient and accurate evaluation of weak form integrals that arise in enriched finite element methods. For discontinuous functions we present an algorithm for the construction of Gauss-like quadrature rules over arbitrarily-shaped elements without partitioning. In case of singular integrands, we introduce a new polar transforma...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1607.04245  شماره 

صفحات  -

تاریخ انتشار 2016